Parameter estimation of quantum processes 
using convex optimization 



Research report 



Gabor Ballo 

Department of Electrical Engineering and Information Systems, 
University of Pannonia, 
Veszprem, Hungary 

Katalin M. Hangos 

Process Control Research Group, 
Computer and Automation Research Institute, 
Budapest, Hungary 

April 30, 2010 



Abstract 

A convex optimization based method is proposed for quantum process to- 
mography, in the case of known channel model structure, but unknown channel 
parameters. The main idea is to select an affine parametrization of the Choi 
matrix as a set of optimization variables, and formulate a semidefinite pro- 
gramming problem with a least squares objective function. Possible convex 
relations between the optimization variables are also taken into account to 
improve the estimation. 

Simulation case studies show, that the proposed method can significantly 
increase the accuracy of the parameter estimation, if the channel model struc- 
ture is known. Beside the convex part, the determination of the channel pa- 
rameters from the optimization variables is a nonconvex step in general. In 
the case of Pauli channels however, the method reduces to a purely convex 
optimization problem, allowing to obtain a globally optimal solution. 
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1. INTRODUCTION 



1 Introduction 

In these days, the building of quantum computers that can be used to solve 
realistic, large scale problems — including the breaking of cryptographic codes and 
the simulation of complex quantum systems — has two main difficulties from the 
theoretical point of view. The first is decoherence, in other words the unavoidable 
coupling of quantum systems with their environment, which leads to the altering or 
even the complete destruction of the quantum system's state, thus causing errors in 
the calculation. The other main obstacle is the lack of ability to fully manipulate and 
extract full information from the physical system, i.e. the control and estimation of 
the quantum states and channels, called quantum processes. This latter problem has 
to be handled by the formalism of the modern methods of system- and control theory. 
It is certainly not a trivial problem, because in quantum mechanics, one has to face 
the difficulty of the threatment of measurements. Namely, that no measurement can 
be carried out on a quantum system without disturbing the state of the system itself. 

The task of the estimation of quantum channels — commonly known as quantum 
process tomography (QPT) [1] — got a significant attention over about the last ten 
years. It is undoubtedly a fundamental problem of quantum information theory, as 
it has considerable relevance not only in quantum computers, but also in the field of 
quantum communication and cryptography. For example, quantum communication 
channels usually rely on a priori knowledge of the channel properties. 

The problem of quantum process tomography was investigated by several authors 
[H 121 E]. The work j3] gives a comprehensive survey on the different strategies used 
for process tomography (or channel estimation). The problem can essentially be 
formulated in two type of methods: direct, and indirect. In the indirect method, we 
trace the problem back to quantum state tomography, i.e. the information about the 
unknown quantum channel is obtained by sending known probe quantum systems 
through the channel, and performing state tomography on the output states. In 
contrast, in the direct method, the experiments directly give information about the 
channel, without the need for a state tomography step. 

From a methodological point of view, there are two principally different ap- 
proaches to the problem of quantum tomography, the statistical approach and the 
convex optimization based approach j2]. The former gives information on the statis- 
tics of the estimate and on its covariance matrix, but it has the drawback, that it 
is hard to compute in higher dimensions. In spite of this, majority of the existing 
methods belong to this category. 

In contrast to this, an optimization based method does not give as much infor- 
mation, but it is relatively easy to compute. This approach has been pursued in the 
work [5] where the problem of channel estimation (in the form of the Choi matrix) 
is considered, assuming a completely general channel, thus without any assumption 
on the inner structure of the Choi matrix. The author uses random input states, 
and random measurements on the output, and formulates a maximum likelihood 
problem. A similar method is used in the work jS], which formulates the task of pro- 
cess tomography as a least squares problem, which is convex. It also uses the Choi 
matrix as optimization variable, thus searches the optimal channel in the convex set 
of all CPTP maps using multiple input-measurement pairs. 
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2. QPT AS AN OPTIMIZATION PROBLEM 



However, as it is stated in [2, it is a reasonable assumption to consider only a 
certain family of channels given with a model, based on a priori knowledge about 
the structure of the channel. The aim of this work is to develop a method which 
is capable of incorporating these constraints into the channel tomography problem, 
while still remaining - at least partially in the general case - solvable by convex 
optimization. 

The accuracy of the estimation is measured with the empirical covariance matrix. 

The rest of this paper is organised as follows. In Section |2] the formal description 
of quantum process tomography is given, and a least squares objective is derived for 
the solution. In Section [3] the variables and constraints of the convex optimization 
problem are considered, and the extraction of the channel parameters is discussed. In 
Section H] the procedure is demonstrated on several case studies. Finally, in Section 
[5] the conclusions are given, and the direction of further work is discussed. 

2 Quantum process tomography as an optimization 
problem 

In this work, the indirect procedure of process tomography is followed, mostly 
relying on the work [H]. The formal mathematical description of process tomography, 
i.e. the so called tomography configuration contains the following elements: 

• A known input density operator p on the Hilbert space T-L of the system. 

• The unknown quantum channel S : /^(H) — ?■ CiTi), which is to be estimated. 
The channel can be written for example in the Kraus representation, in which 
case the output of £^ is cr = S{p) = '^j ^iP^l, where the Ej operators are the 
operator elements of the channel S. These must satisfy the trace preserving 
constraint E^Ej = i, where i is the identity operator 

• A set of POVMs. A POVM M = {M„} is a set of positive operators, with 
which we can perform quantum measurement on the channel output state a. 
In order to be able to uniquely identify the channel output state, an important 
requirement in quantum process tomography is that the measurements must 
be tomographically complete. This means that the measured POVMs must 
provide all the information about the output state, and thus on the channel. 
Such a set of measurement operators is sometimes called a quorum. 

Note that we can use multiple different tomography configurations, i.e. different 
input states and POVMs in order to achieve better estimation on S. In this work, 
the input-POVM pair corresponding to the 7th configuration is denoted by p^ and 

2.1 Data collection 

The first stage of process tomography is the collection of the measurement data 
into a measurement record. The measurements are performed in each 7 configuration 

^Here ^ denotes the adjoint of the operator. 
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2. QPT AS AN OPTIMIZATION PROBLEM 



2.2. Least Squares estimation 



Pi 





Outcome counts 



{ 



^Q;,^7 trictls ^ 



Figure 1: The scheme of data collection for process tomography. 



times independently. This scheme can be seen in Figure [H 

During data collection, the different a outcomes of the measurements in the 
configuration 7 are counted in the variable 0^,7, and put in the measurement record 
D. Then obviously 

^ ^ C(^,7 ^7 • 
a 

Thus, we have to perform a total number of ntot = ''^7 independent measure- 
ments. The estimator £ of the channel E will be calculated from these measurement 
outcomes, based on some estimation procedure. 



2.2 Least Squares estimation 

The next step of the tomography problem is to choose a suitable estimation pro- 
cedure. The least squares is a popular method, because it is easy to implement, and 
it results in an estimation procedure, which can be used in higher dimensions, and 
for multiple parameters. So in this work it is used for quantum process estimation. 
The exact form of the LS objective function for the process tomography problem is 
derived in the following. 

If we write the cr^ = S{pj) output of the channel in Kraus representation: 

i 

then the probability density function of the measurement results can be modeled by 
p,,^ = Tr (fT^M„,^) = Tr eJm,,^E,p^ j . (1) 

In this expression, the optimization variables would be the Kraus operator ele- 
ments Ej. However, as these do not form a convex set, the resulting LS optimization 
problem would be nonconvex. To overcome this difficulty, we can choose the Choi 
matrix as optimization variable. 

A common definition of the Choi matrix is through the use of the Jamiolkowsky 
isomorphism [S]. Let "Hi and be Hilbert spaces, and let A be a bounded linear 
operator in C{'Hi,'H2)- Then we can associate A with a vector in the Hilbert-space 

\A))■.= J2M^l)\^2) , 
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2.2. Least SquEires estimation 



2. QPT AS AN OPTIMIZATION PROBLEM 



where and {|^2)} are the respective bases of the spaces Tii and 7^211 By this, 

noting that in our case "Hi = 7^2 = "H is the Hilbert space of the input state, the 
Choi matrix of the channel S will be 

:=5^|E,))((E,| , 

k 

where the vector |Efc)) in 7/ ® 7/ is associated to the operator E^. The condition of 
complete positivity can then be written as 



and the channel is trace preserving if 

The scalar product of these "double-ket" vectors is defined in a natural way: 

((A|B)):=Tr(AtB) , (2) 

what is by definition the Hilbert-Schmidt scalar product of the two operators. We 
can also identify a useful rule easiljH: 



(A®B)|C)) = |BCA^ 



(3) 



From the above we see that an isomorphism can be made between the density 
operators on and the pure states of the space "H (>>> "H. 

Now, using the above, we can continue the derivation of ([1]). By the use of 
relations ([2]) and we get 



Pa,7 = ^Tr(^E|Mc,,T,Ej/9^^ 

i 

= 5^((Mt,,^E,|E,p,)) = 

i 

= 5^((E,|(i^Mt^,^)(p>i)|E,)) 



(p>Mt,,^)|E,))((E, 



— Tr(RQ,_^X£:) , 

where X^- is the Choi matrix of the channel S, and the operator Ka.-y depends on 
the channel input p and on the measured POVM elements in configuration 7. 

Proceeding to the derivation of the LS objective function, a statistics for esti- 
mating the probability be given by the 



(4) 



^The |.)) notation indicates tliat tliese vectors represent operators. 
■^Here ^ denotes tlie transposition. 
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3. ESTIMATION OF CHANNEL MODEL FAMILIES 



relative frequency that can be calculated from the measurement results. The variance 
of this unbiased estimate after independent measurements is known to be 

Var(j5^(a)) = — ^^.^(l - ^0,7) , 

because p^{a) has a binomial distribution. These show that for large n^, p^{a) — )■ p^,^ 
and Var(p^(a)) tends to as — )■ cxd, so is a reasonable unbiased estimate 

of the real value Pa,-)- This leads to formulating the parameter estimation problem 
as the following least squares objective: 

argmin Vi^^iXe) = V Tr(P-;P - R„,^X^)' , (5) 
so that X^>0, Tr^(X^) = i 

where the diagonal matrix P^™'' = P-yi.'^) ds^y^y;) is defined for simplification pur- 
pose. 

This problem is a convex optimization problem in the Choi matrix X^:, thus it 
can be solved relatively easily using existing numerical algorithms [HI [TO] . 



3 Estimation of channel model families 

It is reasonable to assume that we know a model type of the channel, and only 
the unknown values of the parameters of this model have to be estimated. In such 
a problem, the above derived least squares objective cannot be used directly, as it 
assumes a completely general channel model, and estimates the elements of the Choi 
matrix. Thus, if the task is to estimate some specific model parameters, this method 
can suffer significantly from overparametrization. 

As a possible solution, we can study the internal structure of the Choi matrix, 
and use this information to select more appropriate, model specific parameters for 
optimization. Effectively, this should reduce the set of optimal solutions of problem 
([5]) to solutions, which are consistent with the desired model family. 



3.1 The complex LMI constraint 

The natural choice would be to select just the unknown channel parameters, 
however it can be easily seen, that this choice would ruin convexity, as the Choi 
matrix can be an arbitrarily nonconvex function of these in the most general case. 

Thus, instead of this, the following method is used. Let hi{6), . . . , hm{0) denote 
functions of the channel parameters, and let Hq, Hi, ... , denote constant Her- 
mitian matrices. Then we can expand the Choi matrix as an affine function: 

Xe = J2Hkhk{e) + Ho (6) 

k 

It can be seen, that if we consider the functions hk{0) as optimization variables, 
we can replace the completely unknown matrix variable X^ in problem ([5]) with a 
matrix, which has a known structure up to an affine approximation. 
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3. ESTIMATION OF CHANNEL MODEL FAMILIES 



Thus, the positivity constraint in ([5j) turns into a complex LMI constraint, and 
the trace preserving constraint can in turn be omitted, as if an exact channel model 
is known, it can always be taken into account in the model construction. 

3.2 Convex constraints 

It can occur, that some of the functions hk{6) depend on one or more other 
functions. If this relation / is convex, then we can use it to define additional con- 
straints to make the estimation more accurate. We can determine a minimal set 
V = {hi{9)}, where I & ly. The set Jy contains the indexes of those functions among 
hk{d) [k = 1, . . . , m), which do not depend on each other through a convex relation, 
thus are independent in this sense. These can in general be written as 

hk{0) = r™(/iiW, . . . , KAO)), hi{e) G V, / = 1, . . . ,m' 

Thus, if we have convex relations between the functions hk{0) chosen as optimization 
variables, then the identification of these relations can help to reduce the number of 
these variables, which can be helpful in the parameter extraction step (see subsection 

m). 

However, these relations cannot be added simply as equations to the problem 
because a strictly convex function with equality is not a convex constraint. In 
these cases we can relax the equality, and add the convex constraint to the problem 
as inequality. 

If ric such constraints are found, then we obtain the following optimization prob- 
lem (omitting the trace preserving constraint): 

argmin Vts(Xf) , 
h(e) 

so that X£- > , (7) 
and fr{hi{e),...,h„Ad)) <hk{0), k = l,...,n. 

After solving this problem we get an optimal value = Vi^sO^s) ^^^^ objective, 
where the * denotes the optimality. Then we formulate for each k the following 
auxiliary problem: 

argmin hk(6) , 
h{e) 

so that > , (8) 
and fr{hi{0),...,hm'{e))<hk{e), k = l,...,nc, 
and Vk = VlsO^s) < t'Li 

By solving these for each k in succession, we make sure that the additional convex 
constraints in ([7]) get as close to equality as possible, thus making the parameter 
estimation procedure more accurate. However it would require further studies to see 
whether this method always guarantees a global optimum in the original objective, 
and whether it is reachable in finite steps. In practice, the effectiveness of this heuris- 
tics is based only on the fact, that in case of the optimal solution (in the model of 
the real channel), all of the convex constraints are saturated. 
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4. CASE STUDIES 



3.3. Determining the model parameters 



3.3 Determining the model parameters 

After we performed all the needed optimization steps and obtained an optimal 
Choi matrix together with the optimal variables hi, the next step of parameter 
estimation is to fit hi to the predefined optimization variables hk{6). In essence, this 
problem can be written as 



where the norm is arbitrary. 

Of course, only those parameters has to be used here which are in the set V 
of independent variables. Using the Euclidean norm, this problem thus can be re- 
duced to the following. Let Jy denote the index set of the independent optimization 
variables: 



This is a nonlinear least squares problem, which is not even convex in the general 
case, because of the hi{6) functions. If however these functions are convex, then the 
problem ([9]) will be convex too. 

Note that the number of independent optimization variables determines the com- 
plexity of this nonlinear least squares problem, as having fewer variables result in 
fewer nonlinear equations. 

4 Case studies 

The aim of the simulation experiments was to analyze the effect of additional 
constraints on performance of the numerical optimization based estimation of quan- 
tum channels. Results were generated in MATLAB environment, using simulated 
random measurement data. The optimization problem ([7]) and ([8]) were solved using 
YALMIP modeling language [11] and the SDPT3 solver [12]. The extraction of the 
model parameters, i.e. the solving of ([9]) was in general done by the built in nonlin- 
ear least squares solver of MATLAB, solving it multiple times using random initial 
conditions to achieve a global minimum with high probability. 

4.1 Tomography configurations 

The experiments were set up as follows. 

• The used input states were all pure states. 

• To obtain a tomographically complete measurement, appropriate POVMs were 
selected on the Hilbert space of the system, and were used for measurement. 
In two dimensions, these are the Pauli matrices, and observables with simi- 
lar properties in higher dimensions. Each of these can be decomposed into a 
POVM, which can be used in one configuration. Note that the tomographically 
complete measurement in this setting may not require an operator basis of ob- 
servables to be used. The knowledge about the channel structure may allow us 
to identify parts of the state space where the effect of the channel is the same, 



argmin ||h* — h(^)|| , 




(9) 



laiv 
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4.2. The generalized amplitude damping channel 



4. CASE STUDIES 



thus requiring only less observables (see case study in subsection 14. 4p . The 
only important thing is to get information on all of the channel parameters. 

• The total ritot number of measurements was distributed among all the config- 
urations equally, i.e. for each configuration 7, an equal number of experiment 
were used. 

Each experiment setup was repeated five times and their average was taken. Each 
of the estimated process Choi matrices and channel parameters were analyzed 
using the following four estimation performance measuring quantities: 

• Fidelity of the estimated and ideal channel output: £(p)). The fidelity 
of two quantum states is defined by 



a) = Try p^ap'^ 



The empirical mean 6 of the estimated parameters 6. 

The empirical covariance matrix of the estimated parameters 6. This could be 
used only in cases, where the empirical covariance could be made diagonal with 
appropriately selected measurement observables. The reason for this is that in 
other cases, the estimated parameters were correlated. One way to obtain a 
diagonal empirical covariance is, that the output state parametrization has to 
be compatible with the channel parametrization in the sense that each state 
parameter is affected only by one channel parameter. In this case we can use 
that set of operators for measurement in which the state parametrization is 
given, and through this compatibility, we will be able to estimate the channel 
parameters independently |13| . 



Thus, only the diagonal elements of the covariance matrix, i.e. the variances 
of the parameters were computed: 



Var(^,) 



1 ^ 



The Hilbert-Schmidt norm of the estimation error: IIX — X| 



4.2 The generalized amplitude damping channel 

The Choi matrix of the generalized amplitude damping channel is 

1 — 7 + ^7 ^/T^^ 
7 — p7 



X. 







p7 
-P7 + 1. 
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4. CASE STUDIES 



4.2. The generalized amplitude damping channel 



Here the parameter 7 gives the strength of the noise effect in the interval [0, 1], and 
p parameterizes the output state for 7 = 1 also in the interval [0, 1]. This is |0) when 
p = 1, and |1) when p = in the computational basis. 

the above Choi matrix can be decomposed in the following way: 



Using 



Ho 



1000 
0000 
0000 
0001 
1000 
0-100 
0010 
000-1 



Hi 



-1000 

0100 

0000 

0000 

0001" 

0000 

0000 

1000 



and the optimization variables will be 

hi = 7, /i2 



PI, 



'1-7- 



Notice that we can find a convex relation between these variables, thus we can 
further reduce their number: 

hl-l = -hi 

If we now put these into the problem ([7j), we get the following optimization 
problem: 



arg mm Vn 
h(0) 



so that X^- > 0, 
1 < -hi 



(10) 



and hi 



The type ([S]) auxiliary problem will be: 



argmin —hi , 

h(0) 



so that X^- > 0, 
and h^ — 1 < —hi , 
and VLsiX£)<v; 



After solving these, the final step is the parameter extraction using (jH]). As the 
independent set V = {/ii,/i2} of optimization variables contains nonconvex func- 
tions, this problem will also be nonconvex. However, in this simple two dimensional 
case, we can see that the solution of (|9]) will be explicitly 



1 = hi, P 



hi 



The input state used in this example was the pure state with the Bloch vector 
^[1,1, 1]""". The applied observables were the Pauli matrices 



CTl 



" 


1 " 




" -i " 




" 1 " 


1 





, 0-2 = 


i 


, 0-3 = 


-1 
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4. CASE STUDIES 
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Figure 2: The parameters are 7 = 0.7 and p = 0.3. (a) shows the fidehty of the estimated and ideal 
channel output, [(b) [ shows the expected value of the estimated parameters, 



shows the variance 



of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation error. 



These form a tomographically complete set of observables for the two dimensional 
case. Each was used as a two-element POVM in a different configuration, thus a total 
of three configurations were used. Unfortunately, this set does not allow the inde- 
pencent estimation of the parameters for this channel, thus the empirical covariance 
matrix cannot be used in this CcLSG clS performance indicator. 

The resulting estimations, and the used characteristic quantities are plotted for 
three different pairs of exact parameter values in Figures [2}|H The chosen (7, p) 
pairs were (0.7,0.3), (0,1) and (1,0). These values were selected to represent both 
the interior and border of the parameter space. 

From the results, we can conclude that the estimated parameters are very accu- 
rate, even on the border of the parameter domain. 

4.3 T-representation of qubit Pauli channels 

In the two dimensional case, we have the possibility to expand the quantum 
channels in the Pauli basis, thus giving the affine map of the channel, which takes 
Bloch vectors to Bloch vectors |14j : 

"1 " 

where T3 can be assumed diagonal with elements a, /3, and 7 without loss of gener- 
ality. 
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4.3. T-representation of qubit Pauli channels 
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Figure 3: The parameters are 7 = and p = 1. As p drops out from the Choi matrix, it cannot be 
estimated, (a) shows the fideUty of the estimated and ideal channel output, [(b) [ shows the expected 
value of the estimated parameters, (c) shows the variance of the estimated parameters, and |(d) 
shows the Hilbert-Schmidt norm of the estimation error. 
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Figure 4: The parameters are 7=1 and p = 0. This is an estimation on the border of the parameter 



domain, (a) shows the fidelity of the estimated and ideal channel output, |(b)| shows the expected 



value of the estimated parameters, (c) shows the variance of the estimated parameters, and |(d) 
shows the Hilbert-Schmidt norm of the estimation error. 
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4. CASE STUDIES 



If we restrict ourselves to Pauli channels, i.e. we set t = 0, then the Choi matrix 
of a quantum channel in this T-parametrization is 



1+7 
2 








a+l3 - 
2 





1-7 
2 


2 








a-(S 
2 


1-7 
2 





a+l3 
2 








1+7 
2 . 



Here, the allowed values for the three parameters a, /3 and 7 for the channel to send 
density matrices to density matrices are 

-l<a,/3,7<l , 
and the positivity constraint is satisfied if and only if 

|1 ±7| > |a±/3| . 

Using ([6]), the Choi matrix of this channel can be decomposed in the following 
way: 

1 ■ 
0010 
0100' 
1000 
10 00" 
0-100 
00-10' 
0001 

and the optimization variables will be 

hi = a, /i2 = /3, /i3 = 7 • 

There are no convex relations in this case, so only the solution of is required. 

The fact, that the optimization variables h^. are exactly the channel parame- 
ters themselves, makes the parameter extraction step (Q also unnecessary for this 
channel. Thus, we can see that in this representation, the estimation of any two 
dimensional Pauli channel is a convex problem. 

The input state and the POVMs were the same in this experiment as in sub- 
section 14. 2^ so the same three configurations were used. Note however, that in this 
example, the set of observables are not only tomographically complete, but also 
make it possible to estimate each channel parameter independently, resulting in a 
diagonal empirical covariance matrix. 

The resulting estimations, and characteristic quantities are plotted for three dif- 
ferent sets of exact parameter values in Figures OEl The chosen [a, /3, 7) vectors 
were (0.3, -0.1, 0.1), (0.5, 0.3, 0.2) and (0.4, 0.1, -0.5). These values were selected to 
represent both the interior and border of the parameter space. 

As we can see, the parameter estimation in this convex case is very simple and 
accurate. 
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4.3. T-representation of qubit Pauli channels 
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Figure 5: The parameters are a ~ 0.3, /3 = —0.1 and 7 = 0.1. (a) shows the fidelity of the estimated 
and ideal channel output , [(b) | shows the expected value of the estimated parameters, (c) shows the 
variance of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation 
error. 
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Figure 6: The parameters are a — 0.5, /3 ~ 0.3 and 7 = 0.2. (a) shows the fidelity of the estimated 
and ideal channel output , [(b) | shows the expected value of the estimated parameters, (c) shows the 
variance of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation 
error. 
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Figure 7: The parameters are a = 0.4, 13 — 0.1 and 7 — —0.5. (a) shows the fidelity of the estimated 
and ideal channel output, [(b)| shows the expected value of the estimated parameters, (c) shows the 
variance of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation 
error. 



4.4 The generalized Pauli channel in 3 dimension 



The generalized Pauli channel is discussed in |13] . It is a generalization of the 
Pauli channels for qubits into higher dimensions. Its definition for a general Hermi- 
tian operator A taken from the operator algebra C{'H): 

£{A) = ( 1 - E ^] + E ^^^^(A) ' 

V i=l / 1=1 

where d is the dimension of the input quantum system, and u is the number of com- 
plementary subalgebra^oi C{'H) used in the channel construction. Let Ai, . . . , Ai^ be 
pairwise complementary subalgebras, which linearly span the whole algebra CiTi). 
Then Ei is the orthogonal projection with respect to the Hilbert-Schmidt inner 
product onto Ai. 

Here, we consider the case when all of the complementary subalgebras are max- 
imal Abelian. In this case the set of subalgebras correspond to a set of orthonormal 
bases called mutually unbiased bases (MUB). Two orthonormal bases and 
{|V'2,j)} are mutually unbiased if and only if the maximal Abelian subalgebras Ai 
and A2 containing operators diagonal in the bases {iV'i.j)} ^.nd {|'02,j)} are comple- 
ment aryU 



"^Two subalgebras Ai and A2 of C{H) are called complementary if the traceless subspaces of 
Ai and A2 are orthogonal with respect to the Hilbert-Schmidt inner product |15| . 

^By another definition, two orthonormal bases and {|V'2j)} are mutually unbiased if for 
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4.4. The generalized Pauli channel in 3 dimension 



Let {IV^ij)}, ■ ■ ■ , ili^u.j)} be u sets of bases which are pairwise mutually unbiased. 
Then the orthogonal projection Ei can be constructed as 

d 

i=i 

The effect of the generalized Pauli channel is thus the following. It projects the 
input state onto each subalgebra Ai, then scales the results with the corresponding 
channel parameter Aj. Trivially, if Aj = for all i, then the result is the completely 
mixed state ^1. In contrast, if Aj = 1 for all i, then we get back the input state as 
expected. 

The channel is trace preserving by construction, and completely positive if and 
only if the parameters Aj satisfy 

l + rfAj> J]A,>^, (11) 
j 

A set of MUB can be constructed, for example, as follows [T3]. Let \ifo)^ • ■ ■ > \ fd-i) 
be a basis. Let X and Z be unitary operators such that 

^I'^i) = IV'i+l mod(d-l)) , 

Then the unitaries Sj^k '■= Z^X^ {j, k = 0, . . . , d — 1) are pairwise orthogonal with 
respect to the Hilbert-Schmidt inner product. If we select the matrices X, Z, and 
also each one in the form ZX^, k = 1, . . . , d — 1, then the eigenvectors of each of 
these unitaries form a set of MUB. Moreover, if d is prime, then this procedure 
results in exactly d + 1 MUB, which is the maximum number of MUB possible. 

In the present example, we choose d = 3, thus the used MUB will be the bases 
obtained using the eigenvectors of the matrices X, Z, ZX, ZX^. Then the Choi 
matrix of the channel will be the following: 
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all fc, ^ = 1, . . . , d they satisfy |('0i,fc|^2,;)P = 2' In dimensions, the maximal number of pairwise 
mutually unbiased bases is c?+ 1. Currently, this maximal number of MUB are known to exist only 
in dimensions which are prime power |16| . 
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where 

/i = 1 + 2A2, 

/2 = 1 - A2, 
/s = Ai + A3 + A4, 

/4 = Ai-^(l + iv^) 

Note that the n = 2 case using the same MUB selection method is exactly the same 
as the T-representation of Pauli channels in subsection 14.31 

The decomposition (|6]) of the Choi matrix can be calculated easily based on the 
previous examples, given that the optimization variables are exactly the channel 
parameters: 

hi = Xi, i = l,...,4 

This, and the construction of the channel shows that the problem of parameter 
estimation is convex, and solvable using only in any such - arbitrarily high - 
dimension, in which the channel itself can be defined. 

The input state used in this experiment was the pure state 

I*) = ^(lV^l,l) + 1^^2,1) + 1^3,1) + 1^^4,1)) 

which is constructed from the first vector of each basis of the MUB, on the 

analogy of the Bloch vector 1, 1]^ in the two dimensional case. Based on this 
analogy, this state is expected to be sufficient for the characterisation of the channel 
parameters. 

As we have discussed before, each channel parameter Aj affects the length of the 
projected input state Ei{p) in the subalgebra Ai independently. The channel has 
no other effect. Thus, we can measure the effect of the channel by focusing only on 
the subalgebras, and estimate the parameters independently. For this purpose, any 
Hermitian operator belonging to the subalgebra Ai can be considered as observable, 
and can be used for the estimation of Aj. The reason for this is that the effect of the 
channel is the same on the whole subalgebra, so it is enough to measure any direction 
inside Ai to get information on Aj. Moreover, the independence of the measurements 
on different subalgebras result in a diagonal covariance matrix. 

For example, valid observables for each subalgebra can be constructed using the 
MUB in the following way: 

d 

This way we get observables with d different eigenvalues. Beyond that, the specific 
value of the eigenvalues are irrelevant, as we are interested only in the outcome 
probabilities. 

As it can be seen from the above, a total of four configurations were used in 
the experiments. The resulting estimations, and characteristic quantities are plot- 
ted for three different sets of exact parameter values in Figures IHlflOl The chosen 



-^(l-iV3), 
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Figure 8: Ai = 0, A2 = 0.15, A3 = 0.3 and A4 — 0.5. (a) shows the fideUty of the estimated and ideal 
channel output, [(b) [ shows the expected value of the estimated parameters, (c) shows the variance 
of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation error. 



(Ai, A2, A3, A4) vectors were (0,0.15,0.3,0.5), (0.1,0.3,0.4,0.5) and the third was 
(—0.3, —0.2, —0.1, 0.1). These values were selected to represent both the interior 
and border of the parameter space. 

We can see that in the case of a qutrit channel with a high number of parameters, 
this method can provide very accurate parameter estimation, even on the border of 
the parameter domain. In some of the images, the Hilbert-Schmidt norm does not 
seem to decrease over the given range of the measurement number rij. Results from 
experiments performed with higher measurement numbers show that the reason of 
this is slower convergence. If measured this way, the desired estimation error is 
reached with a greater number of measurements in higher dimensions, due to the 
higher number of Choi matrix elements. 



5 Conclusion 

In this work, we introduced a method for the parameter estimation of quantum 
channel model families, based on convex optimization. 

Using simulation case studies it has been shown, that the proposed method 
of affine decomposition of the Choi matrix can significantly increase the accuracy 
of the parameter estimation for the case of a known channel model structure. In 
addition, with performing auxiliary optimization problems, convex relations between 
optimization variables can also be exploited to improve the estimation. 

In some of the cases, particularly that of Pauli channels, this method results 
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Figure 9: Ai = 0.1, A2 = 0.3, A3 = 0.4 and A4 = 0.5. These saturate the first inequahty m (|lip . 
so this is a border point of the parameter space, (a) shows the fidehty of the estimated and ideal 
channel output, [(b) [ shows the expected value of the estimated parameters, (c) shows the variance 
of the estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation error. 
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-0.2, A3 = —0.1 and A4 = 0.1. These saturate the second inequality 



in pT|) . so this is again a border point, (a) shows the fidelity of the estimated and ideal channel 



output, [(b) I shows the expected value of the estimated parameters, (c) shows the variance of the 
estimated parameters, and |(d)| shows the Hilbert-Schmidt norm of the estimation error. 
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in a purely convex optimization problem, thus we can obtain a globally optimal 
estimation with relatively simple numerical algorithms. In the general case however, 
beside the convex part, the method may need a nonconvex optimization step as well. 
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